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Abstract. We study CMV matrices (a discrete one-dimensional Dirac-type 
operator) with random decaying coefficients. Under mild assumptions we iden- 
tify the local eigenvalue statistics in the natural scaling limit. For rapidly de- 
creasing coefficients, the eigenvalues have rigid spacing (like the numerals on a 
clock); in the case of slow decrease, the eigenvalues are distributed according 
to a Poisson process. For a certain critical rate of decay we obtain the circular 



the square of the coupling constant. 



1. Introduction 

The bulk spectral properties of Schrodinger operators and Jacobi matrices with 
random decaying potentials are fairly well understood; see p] |B1 GHl ED IS] • In 
particular, it has been discovered that there is a change in spectral type at certain 
critical rate of decay. It is natural to investigate the finer spectral properties; for 
example, the distribution of eigenvalue spacings. In order to speak of eigenvalue 
spacings, one first restricts the operator to a finite volume and then studies the 
behaviour of the rescaled spacings in the infinite volume limit. Such an investigation 
is the goal of this paper; however we treat neither Schrodinger nor Jacobi operators, 
but rather CMV matrices: 

Definition 1.1. Given an infinite sequence of coefficients ao,ai, . . . in D (the open 
unit disk in C), we define 2x2 matrices 



OLk Pk 

Pk -OL k 



where p k = yl — |afc| 2 - From these, we form block-diagonal matrices 

C = diag(S ,S 2 ,S 4 , . . .) and M = diag(S_ x , H x , H 3 , . . .), 

where S_x = [1], the lxl identity matrix. The CMV matrix associated to the 
coefficients ao, a±, . . . is C = C(«o, Qi, . . .) = CM.. 

The name CMV is taken from from the initials of the authors of 4 , following 
recent custom; however as discussed in |38l 145) . they appeared earlier in several 
places. A thorough discussion of these operators can be found in |85l 

CMV matrices are unitary and may naturally be considered a discrete form 
of one-dimensional Dirac operator. There are several reasons for our choosing to 
consider these operators: they are bounded, discrete, and most importantly, possess 
a rotational symmetry that we will discuss in due course. 
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The restriction of a CMV matrix to a finite interval is not simply the top left 
corner of the original matrix — this would result in a non-unitary operator — one 
needs to add a boundary condition. If one replaces a n —i by e tr) £ S 1 (the unit 
circle in C), then C decomposes into a direct sum; specifically, the top left n x n 
block decouples from the remainder of the matrix. We define the n x n truncation 
of C with 77 boundary condition in this way and write 

= C(a ,...,a n - 2 ,e i *). 

We wish to consider the model where e lr> and each a k is chosen independently 
(though not identically distributed) with a rotationally invariant distribution, that 
is, 

E{g(e"r)Y[f k (e^a k )} = ^ Y[E{f k (a k )} 

for any finite collection of f k ■ © — > C and 9 k £ R. The reason for considering only 
rotationally invariant distributions is that in this case the spectral properties are 
homogeneous under rotations of the circle. For example, for any 8 £ [0,27r), 

e- i6 C(a , ai, . . . , a„_ 2 , e">) « C(e l9 a , e 2i %, e l ^ e a n ^ e ir > +in6 ), (1) 

which is to say that the two matrices have the same eigenvalues that is, they are 
unitarily equivalent (cf. |351 p. 81]). 

For power-decaying a k , the bulk spectral properties for this model were worked 
out in [361 §12.7], following the lines of earlier arguments from the Jacobi and 
Schrodinger cases. Let us now describe the finer behaviour we wish to investigate. 

Associated to the random n x n matrix C^™' are n random eigenvalues, which 
we denote e lSl , . . . , e l9n , with 9j £ {—ir, n] . By rotation invariance, QJ, the average 
number of eigenvalues with 0j £ [a, b) is n. For this reason it is natural to rescale 
the eigenvalues by a factor of n. Our goal is to show that under natural assump- 
tions, the eigenvalue process, that is, the random set of points {n9\, . . . , n9 n } C R, 
converge (in an appropriate sense) to a point process and to understand the limiting 
process as best we can. 

We will prove the convergence of the expected values of certain random variables 
that are rich enough to characterize the point process: 

Definition 1.2. Given a non-negative / £ C£°(R), we define a random variable 
on a point process by 

X f = exp(-J2f( Xj )) 

where Xj are the points of the process. E{Xj} are known as the Laplace functionals 
of the process. 

One may also characterize a point process in terms of the probability of certain 
events: 

Definition 1.3. Given p disjoint intervals [aj,bj), and p non-negative integers 
ki, . . . , k p , we define the event 

n([ai, h), . . . , [a p , b p ), ki, . . . , k p ) 

as the occurrence of exactly kj points in the interval [aj, bj) for each 1 < j < p. 
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Convergence of the probabilities of these events is fractionally stronger than 
that of the random variables Xf, indeed they are equivalent notions if no location 
is occupied by the process with positive probability (cf. pH] §3-5]). We will use the 
former test for convergence as it is the most convenient for us. 

Three processes will arise as limits of the eigenvalue process; we will now define 
these. 

Definition 1.4. The Poisson process with intensity dv is characterized by 
E{X f } = expjy [e- f{x) - l] du[x) 



or more simply by 



»{fi([o 1 ,6i),... > [a I „6p), = i[ 



K< A 



kA 

where Xj = v([a,j,bj)). (This says that the number of points in disjoint intervals 
are independent Poisson random variables.) 

Definition 1.5. The clock (or picket fence) process with spacing h is characterized 

by 

E i X f} = ll cxp{-J2 keZ f(hk + x)}dx. 

That is, points occur a random translate of hZ. 

The third limiting process we need to describe is considerably more complex. 
We begin with the C/3E„, that is, the circular (3 ensemble with n points. This is 
a point process on the unit circle. The total number of points is n (non-random) 
and they are distributed so that 

K(f) = /; ■■/[ /(e*S • • . ,e»-)|A(e«V . .,e^)f § - (2) 

for any symmetric function /. The partition function is given by 

r(±3n + l) 

n ' p ~ [r(i/3 + i)]" ; (3) 

see El I46j . This is a standard family of ensembles in random matrix theory 
and was introduced by Dyson, [T2]. This ensemble also arises as the Gibb's measure 
for n identical charged particles confined to lie on the circle and interacting via the 
2-dimensional Coulomb law. For this reason, it is also known as the log-gas. 

Just as we discussed earlier for eigenvalues of random CMV matrices, one may 
take the points of the C/3E„ and by unwrapping and rescaling obtain a sequence of 
point processes on the line. It is not obvious that this sequence of point processes 
converges. One off-shoot of the work presented here is that they do (indeed, we 
will obtain a new representation of the limiting process). 

Definition 1.6. The C/3E process is characterized by 

E{X f } = lhn E^{exp{-£ 3 f{n6 3 )}}. 

with the notation of (This limit does indeed exist.) 
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It is not difficult to show that (3 [ limit of C/?E„ converges to a Poisson process 
as n — » oo and that the (3 j oo limit converges a clock process. 

For (3 € {1,2,4}, the C/3E process is very well understood; indeed there are 
explicit formulae for the correlation functions and for the inter-particle spacing 
distributions. Discussion of this work can scarcely be contained in a single mono- 
graph and we have no intention of trying to review it here; the unacquainted should 
probably begin with [2E] or |14| . 

The content of this paper can be best summarized as 

Theorem 1.7. (i) If E{|a/c| 2 } = o(fc _1 ), then the eigenvalue process converges to 
the clock process with spacing 2n as n — > oo . 

(ii) //E{K| 2 } - = oik- 1 ) and E{log 2 [l- |a fc | 2 ]} = o^" 1 ), then the 
eigenvalue process converges to the C/3E process as n — > oo . 

(iii) IfE{\a k \ 2 } > (k + ly- 1 for some e > and E{(1 - \a k \ 2 y s } = 0(1) for 
some s > 0, then the eigenvalue process converges to the Poisson process with 
intensity %zdx as n — > oo. 

This theorem demonstrates that (at least for this model) the transition from 
Poisson to Clock statistics is actually continuous. This moots the question of 
the eigenvalue statistics in a (microscopic) neighbourhood of the (conjectured) 
metal/insulator transition in the multi-dimensional Anderson model. One may also 
ponder the eigenvalue statistics for less random models, for example, the Almost- 
Mathieu operator, which is also known to have such a transition. Unfortunately, 
we have nothing to say on these interesting topics. 

As log[l — r 2 ] = r 2 + 0(r 4 ) as r J. 0, it is natural to consider replacing the second 
hypothesis in part (ii) by E{|afe| 4 } = o(/c _1 ) or perhaps discarding it altogether. 
The first option is conceivable, but would increase the complexity of the proof more 
than it warrants. This second option is certainly not possible. An a k lying very 
close to the unit circle effectively of decouples the CMV matrix into a direct sum. 
For example, if one considers 

\ak\ = 1 — e~ fc! with probability ~ ^r, 

and zero otherwise, then the eigenvalue process becomes a superposition of clock 
processes with random spacings. This has too little eigenvalue repulsion to be 
consistent with C/3E, at least for (3 large. Similarly, the second hypothesis in part 
(iii) is surely an artifact of our proof. 

The proof of part (i) is the most elementary of the three and is contained in 
Section [3] using results from Section [21 A discussion of non-random results of this 
type (with stronger decay assumptions) can be found in 33 §4] ; their proofs occupy 
the subsequent sections of that paper. 

Part (iii) of Theorem 11.71 is proved in Section [S] using methods from the Ph.D. 
thesis of the second author, [411 142| . This in turn is derived from work of Minami, 
|29|. for the Anderson model. 

Part (ii) is proved by combining two ingredients. The first is an exactly soluble 
example of our model that was discovered in |23) . The second is an invariance 
principle that shows that the limiting eigenvalue statistics do not depend on the 
specifics of the probability distribution of the Verblunsky parameters. 

Several results of this genre have been proved, though only for systems which 
lead to (3 € {1, 2, 4}. Such results are reviewed in To our eyes, the most similar 
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results of this type can be found in |2()l 140) . which study the Wigner ensemble of 
Hermitian/ Symmetric matrices with independent entries. 

We now describe the exactly soluble model alluded to above. 

Definition 1.8. A complex random variable, X, with values in the unit disk, ID, 
is 0„-distributed (for v > 1) if 

E{f(X)} = [f f(z)(l - |z| 2 )^ 2 Sz. (4) 

J JD 

For v > 2 an integer, this has the following geometric interpretation: If v is chosen 
at random from the unit sphere S u in W +l according to the usual surface measure, 
then v\ + iv2 is 0„-distributed. 

An elementary computation shows that if X ~ 0„, then 

E{|X| 2 } = 4 T . (5) 

The combined content of Theorem 1.2 and Proposition B.2 from is: 

Theorem 1.9. Given (3 > 0, let ctk ~ 0/3(fc+i)+i be independent random variables. 
Then the eigenvalues of are distributed according to the C(3E n ensemble. 

Note that (22j and its precursor, ^U], contain similar results that could be used 
in a parallel investigation of the Jacobi matrix case. The bulk spectral properties 
of one such model are discussed from a random-matrix perspective in |39| . 

The bulk of the proof of Theorem I1.7f ii1 appears in Section 0] although it also 
relies on material from Section [21 

Next we present our new representation of the C/3E process. The set of critical 
values of any non-decreasing function if> : K — > R (i.e., values taken at more than 
one point) is at most countable; for otherwise one could find uncountably many 
disjoint open subsets of R. Therefore, for all but countably many v € (inf ^sup^) 
we may uniquely define x = tp (v) as the point where ip passes through (or jumps 
past) the value v. 

Definition 1.10. Given a probability measure on non-decreasing functions ip :R — > 
R, with expectation E^, we define an associated point process by 



E{X f } = E i , [ cxp(- V fo V^ 1 (27rm + cj)ldw, 
that is, points are placed at the inverse images of a random translate of 27rZ. 



(6) 



For example, the picket fence process is associated to ijj(x) — x (non-random), 
while a Poisson point process arises when ip is chosen to be (2ir times) the Poisson 
jump process. In general, the average number of points in the interval [a, b) is 
Ety(fc-) - V(o-)}- 

The random non-decreasing function that is relevant for us is the relative Priifer 
phase. This is described precisely in Sectional however, we would like to give some 
of the flavour of this here. 

The notion of Priifer phase originates in the study of Sturm-Liouville problems, 
see for example, 5, Ch. 8]. Given q 6 L 1 , let us define u(x-.E) as the solution of 

-u"(x) + (q(x) - E)u(x) = 
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on [0,L] that obeys u(0) = and u'(0) = 1. The Priifer phase ip(x;E) and ampli- 
tude R(x; E) are defined by 

u(x; E) = R(x; E) sin(^(;r; E)), u'{x; E) = R(x; E) cos(^(x; E)) 

and -0(0; E) = 0. Notice that if we place a self-adjoint boundary condition u' sin(r7) — 
u cos(r]) — at x — L, then the eigenvalues of the Sturm-Liouville problem are the 
energies where ip(L; E) — r\. Note also that ijj(L, E) is a strictly increasing function 
of E; this is essentially the Sturm Oscillation Theorem. 

All the properties just described carry over to the natural CMV analogue of 
Priifer variables (see SectionElor jSEl §10.12]). In the microscopic scaling we con- 
sider, the key quantity is the Priifer phase relative to that at a reference energy. 
Because our model has rotational symmetry we can choose the reference energy 
to be z = 1 without loss of generality. In this way, we will find a natural process 
of increasing functions ipu : (— 7r,7r) — > R so that the eigenvalues of C^ n ' lie at the 
angles where ip n -i matches that random boundary condition r\. Actually, that is a 
lie since ip is merely the relative Priifer phase; nevertheless, because of the way we 
choose our boundary condition, it is true that the eigenvalues of are related to 
the random increasing function ip n -i in the sense of Definition II. 1UI 

The key to proving both that the C/3E„ processes converge and part (ii) of 
Theorem II .71 is showing that in the natural scaling, ^ n (x) = ip n -i(^-), the relative 
Priifer phase process converges as n — > 00. As we will see in SectionQJ this is indeed 
the case; moreover we find an alternate realization of the limiting distribution, which 
we now describe. 

Let Bi(t) and B2(t) be independent Brownian motions. From Proposition 14.51 
it follows that for each x, the stochastic differential equation 

= x dt + -|= Im { [e l * - l] [dB 1 (t) + idB 2 (t)] } (7) 

has a unique solution fy(t;x) that obeys M{^{t;x)} = xt and x^{t\x) > 0. We 
will prove that for any finite collection xi, . . . , x p the vector- valued random variable 
^nix^/n) converges in distribution to ^>(t — 1; x^). By Proposition ^. 51 this notion 
of convergence is strong enough to imply that the Laplace functional of the eigen- 
value process converge to the point process associated to the random increasing 
function ^>{t = l,x). 

While we where typing the last section of this work, two preprints, |KSI I32|. 
appeared on the arXiv. These works (independent of our own) discuss matters 
related to our proof of part (ii) of Theorem 1 1.71 The latter preprint discusses the 
edge statistics for the Gaussian j3 ensembles in the n — > 00 limit. Specifically, they 
show that the in the (naturally re-scaled) limit, the point process can be modelled 
as the eigenvalues of a half-line Schrodinger operator with stochastic potential. 
Our representation of the limiting C/3E ensemble has an analogous interpretation: 
Equation (JTJ) describes the relative Priifer phase for the Krein system with 'poten- 
tial' A(x) = (/3a;) _1 / 2 [o?_B3(i) + idB^t)}, where B^(x) and B^(x) are independent 
Brownian motions. In this way, we may say that the C/3E ensemble is modelled by 
the eigenvalues of this Krem system with random self-adjoint boundary condition 
at x = 1. (For a pedagogical account of the theory of Krem systems, we recommend 
[Hj.) The remarks in the final section of JHj seem to indicate that the authors were 
seeking a result of this type. 
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Notation. We will write X < Y to indicate that X < CY for some constant 
C. Typically, this is an absolute numerical constant whose exact value is of no 
consequence. When it depends some parameters, we will endeavour to point this 
out. 

Acknowledgements. We would like to thank Marek Biskup for some probabilis- 
tic pointers. We would also like to thank Tom Spencer and Jinho Baik for their 
encouragement to pursue this line of investigation. R. K. was supported in part, 
by NSF grant DMS-0401277 and a Sloan Foundation Fellowship. 

2. The basic process 

The theory of CMV matrices is intimately connected to the theory of orthogonal 
polynomials on the unit circle, just as Jacobi matrices are inherently tied to poly- 
nomials orthogonal with respect to a probability measure on the real line. We will 
confine ourselves to a review of the material we need; a more systematic treatment 
can be found in |rS5| . 

Let d[i denote the spectral measure associated to C (a semi-infinite CMV matrix) 
and the vector e\ = (1, 0, 0, . . .). As C is unitary and e\ is a unit vector, this is a 
probability measure on the unit circle in C. Applying the Gram-Schmidt procedure 
to {l,z,z 2 , . . .} leads to a sequence of monic orthogonal polynomials. We write 
<f>k(z) for the polynomial of degree k. 

Two important properties carry over from the Jacobi matrix case: Firstly, the 
orthogonal polynomials obey a recurrence relation: 

$ fe +i(z) = z$ k (z) - a k $%(z) 

$fc+iO) = K(z) ~ a k z^ k (z) lS ' 
with $5 = ^0 — 1- Here $* k denotes the reversed polynomial, 



*;(*) = **$*(*-!). (9) 

Secondly, the characteristic polynomial of finite-volume truncations can be ex- 
pressed in terms of these monic polynomials; specifically, 

det[* - C< n >] - «$„_i(z) - e-^SJUC*). ( 10 ) 

The CMV analogue of the Weyl m-function used in connection with Jacobi 
matrices is the Caratheodory function, 

F(z) = [ MO) = (ei\(C + z)(C- z)-^), (11) 



which will be important in our treatment of part (iii) of Theorem 11.71 The other 
two parts of that theorem rely on analysis of the relative Priifer phase, which we 
will now describe. 

Let Bk{z) = z$k(z)/$l(z). In view of ©, this is a finite Blaschke product; 
moreover, by JSJ, 

B k+1 (z) = zB k (z)- —— . (12) 

1 - a k B k {z) 

Note also that Bq(z) = z. It is natural to define the (absolute) Priifer phase as 
the argument of Bk(z). By the Cauchy-Riemann equations, the argument of a 
finite Blaschke product is strictly increasing and by l|l(J|) . the eigenvalues of C^ n ' 
are precisely the points where B n -\{z) = e tn . 
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The relative Priifcr phase is the sequence of random continuous increasing func- 
tions ipk ■ (— 7T, 7r) — > M dchned by 

and ^fc(O) = 0. Naturally, this sequence also obeys a recurrence relation. The 
relation is simpler when expressed in terms of a new sequence of random variables 
7fe in place of the Verblunsky coefficients. Because we have chosen to study the 
rotationally invariant problem, the two systems of coefficients follow the same law: 

Lemma 2.1. For each n, the random variables 

lk = B k (l)a k , fc = 0,l,2,...,n-2, and e^ = B n _ 1 (l)e i " 

are statistically independent. Moreover, they have the same joint distribution as 
a>o,..., a n ~2 and e %n . 

Proof. One merely needs to make two observations. First, the arguments of a k and 
e %r) are all statistically independent of one another and of the moduli of the the a k ■ 
Second, the arguments are all uniformly distributed on the circle. □ 

Proposition 2.2. As defined above, ip k {9) obeys the following recurrence 

1-7* 



i>k+i =^fc + + 2Imlog 



1 - lk e^x 



= 0, (13) 



where the branch of log is chosen so as to give when ^ k = 0, that is, by the 
convergent power series 

T^, 7 ):=2Imlog[ T ^] = Im § [e«* - lfr* (14) 

e=i 

Moreover, the eigenvalues of C n are associated to ^ n _i(0) in the sense of Defini- 
tion M.llA 

Proof. The first claim is an elementary computation: from l|12|l . 

B k+1 (e ie ) _ %e B k (e l8 ) 1 - a k B k (l) 1 - a k B k (e w ) 
B k+1 (l) 6 S fc (l) 1 - a k B k (l) 1 - a k B k (e ie ) 1 ' 

and so 

1-7* 1 - 7*e^ fc 

To finish the proof of i|13|) . one should note that C/C = exp{2i Imlog(£)} for any 

CeC\{0}. 

To prove the second claim, we note that by l|l(J|l . the eigenvalues of are the 
points e 10 where e %r] B n _\(e %e ) — 1 or equivalently where ip n _ i(9) — ui E 2ttZ. Recall 
that e~ %UJ = B„_i(l)e lr ', which is statistically independent of ^„_i((9). □ 

Warning. Despite the fact that z — 1 is an eigenvalue of if and only if u> = 0, 
V>n-i is not the natural the conditioned process. Specifically, let / be a continuous 
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E{e- tlf ^\lea(C^)} 



:UmE{e- tr ^ c<n) )| ( r(CW)ne i [ o ' e ]^0} 

■ IJn £ E { jf ""^ exp / ° exp(#-l 1 (27rm + W ))] & 

= E{i<_ 1 (0) exp[-£ m / o exp(#-^(27rm)) 



Hence the probability distribution picks up an additional weight, —ip' n —i(0)- It is 
not difficult to derive 



-2 k 



ti(o) = i+EII Re i 



1+7; 



11 



fc=0 2=0 
from 

We will use the following properties of the function T 
Proposition 2.3. Given < r < 1 and 0, ?/> S R, 



2tt 



/■27T 

/ | T (^re«>)| 2 f <^|V 
Jo 



2 \(^r e ' ie )T(0,re l9 )^ 

Z7T 



< 2 



|0|) log[— 



(17) 
(18) 
(19) 



2r 2 Re{[e^-l][e-^-l]}- f * T(^, re i9 )T(</>, re lS ) £ 

Jo 

<2(|^| + |0|)r 2 log[ T ^ 

and also 

Mj^r]" ^-<Wmog 2 [l-r 2 ]. 
Jo z7r 

Lastly, interpolating between <|19|) and (|21|l gives 



\Y(ip, re 2 



,3 d0 



2^ 



<Mlog 3/2 [l^]" 



(20) 



(21) 



(22) 



Proof. Equation (|17|l follows immediately from the mean value theorem for har- 
monic functions. 
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For the second, third, and fourth claims, we apply Parseval's identity to see that 

de 



= E i F i /lm{[e^-l]e« e }lm{[ e 

oo 

= ^2^Re{[ e ^-l][e-^-l]}. 



1]e uo } f 



Inequality JTSJ follows by bounding this by 8r 2 Y^£ 2 ■ As \e l9 — 1| < |0|, we have 

|Re{[e^ - l][e- tf * - 1]}| < £\^\ + i\<f>\. 

To deduce (|19|l , we simply substitute this estimate into the identity above and then 
sum the resulting series. The inequality 1|2U|) follows from the identity above by 
similar reasoning. 

We now turn to H21fl . Applying Parseval's identity to the square of l|14f) gives 



f 1 r 1-re" 1 4 CIO ^ 
J ° q=2 



,2 9 



9-1 



l][e 



(23) 



We may estimate the sum over £ in each of two ways: First, one may bound each 
modulus of each term in square brackets by 2. Second, as the sum is symmetrical 
under £ > (q — £), we may bound it by 



L«/2J 



L?/2J 



«=1 £=1 
Using each of these estimates for one factor of the sum in H23[l gives 

oo q— 1 

LHSO < 161-0] E r 29 Y, T(h) = 16 ^l lo S 2 [ i^] ' 

q=2 1=1 

which completes the proof of the proposition. 

Corollary 2.4. For any rotationally invariant system of parameters, 
E{M&)} = is + 1)9, n\^s{0)\\ = (s+ 1)\9\, 

and for k < m, 



E{hM^)-^(f 



X \ x(m—k) |2 



}= 5>{|T(^), 7s )| 2 }. 



(24) 
□ 

(25) 
(26) 



Proof. For each s, ip s (9) is an increasing function, which vanishes at 9 — 0. Thus 
ip s (9) takes only non- negative values for 6 > and non-positive values for # < 0. For 
this reason, the second claim follows from the first. Both the estimates for the mean 
and the variance follow quickly from the recurrence relation, (|13[1 . and (|17[l . □ 

As outlined in the introduction, we will deduce convergence of the point processes 
from that of the relative Priifer phase. The next result shows how this can be done. 
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Proposition 2.5. Suppose there are processes ^ n {x), n £ Z and ty(x) of non- 
decreasing functions with two properties: (i) for all x £ R, 

lim E{|¥ n (aO|} = E{|*(a;)|} < oo (27) 

n — >oo 

and (ii) for any finite collection of Xj € R, the joint distribution of i S n (xj) converges 
to that of^(xj). Then the point processes associated to ^ n converge in distribution 
to that associated to '5 . 

Proof. The probabilistic portion of the proof is quite simple once we have the right 
deterministic information. To this end, let and $ be non-decreasing functions on 
R and let / G C£°(R) be non-negative. Elementary considerations show that 



[ exp(- ^2 f ° * _1 (27rm + - exp(- / o ^ 1 {2mn + lu)\ 



m£Z 



7rm + w) — / o \]> 1 (27rm + cj) I 



< 



(28) 
(29) 
(30) 



Next, we choose [a, b] C R so that supp(/) C [a, b}. Then 
< \x~y\ ■ where z = 



x < a 
x 6 [a, 6] 
x > 6 



and similarly for y. Therefore, 

\f o®- 1 ^) - f o^^lduj <\\f\\ L , 



= II/'IU- f Mx)-V(x)\dx. 

J a 



(31) 



(32) 



The equality follows by realizing that both integrals give the area of the region 
between the graphs of $ and \1/ over the interval [a, b] (with vertical lines drawn at 
any jumps). 

Combining our efforts so far, we have bounded the expression in (|28|l by that 
in (|32|l . We will make one further reduction. Given an integer p > 1, let us partition 
[a, 6] into p equal intervals by introducing Xj = a + j(b — q)/p with < j < p. As 
both $ and \& are non-decreasing, 

' J+ \®(x)-V{x)\dx (33) 



< ^ [max 



(34) 



< - ${ Xj )\ + \Hx j+1 ) - 9(x j+1 )\ + \*( Xj ) - *( Xj )\], (35) 

which shows that 

f |*(s) - *(x)\dx < 6=2|*(6) - *(a)| + 2^ £ |$(x,) - (36) 

•'a ,•— n 



3=0 



Let us now begin the probabilistic portion of the argument. We will think of 
^ n (xj) and &(xj) as vector- valued random variables where < j < p indexes the 
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components. By the Skorohod coupling, we can realize the random variables ^(xj) 
and all ^> n {xj) on a single probability space so that ^ n (xj) — ► &(xj) almost-surely. 
Using hypothesis (|27[) . we can up-grade this to 

lim E{|*„(a:j)-*(a:i)|} =0, 

n— >00 

which by virtue of (|36|l . implies 
,6 

lim sup / |*„(a;) -V(x)\dx < ^E{|*(6) -*(a)|} (37) 

for any positive integer p > 0. Tracing back the deductions made earlier, we can 
conclude convergence of the Laplace functionals. □ 

Remark: Once one knows that the distributions of ^ n (xj) converge for any finite 
set of Xj, the existence of a limiting process, ^S(x), follows from the Kolmogorov 
extension theorem. 

3. Fast decay 
Theorem 3.1. If E{|a s | 2 } = o(s" 1 ) then Vn-i(f) 

— * x in L -sense for each 
x € [0, oo). This implies that locally the eigenvalues follow clock statistics. 

Proof. As E{| 7s | 2 } = o(s- 1 ), 

J2 E{|%| 2 }=o(l) 

n>s>en 

as n — > oo for any choice of e. Therefore, there is a sequence k n that is o{n) and 
obeys 

n 

]TE{| 7s | 2 }=o(l) 
as n — > oo. Combining this with l|2t)|) and l|18f) we have, 

n-2 

E{|V„-i(*) - Vfc„(f ) - (" - 1 - *n)f I 2 } < ^ £ E {l^| 2 > = (!)- 

s—k n 

Now by the triangle inequality and the Cauchy-Schwarz inequality, 

E{|V„-i(^) - <c|} < E{|^„(f ) - (K + + o(l) 
<2(fe„ + l)M +o( i) ; 

which is o(l) by the way we chose k n . Note that the second inequality follows 
from 1(25)1, 

Convergence of the point processes follows from Proposition 12. 51 □ 

Remark. IfE{|a fc | 2 } e £ l , the n one obtains clock spacing on an almost macroscopic 
scale. Indeed for any (non-random) sequence x n with \x n \ = o(n), 

E{\^ n -i(x n /n) - X n \) — > 

asn-> oo. The proof follows the outline above, but with k n — > oo chosen so that 
= o(n). 
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4. Critical decay 



Given x € K p with coordinates x M , we define a sequence of Revalued processes 
^ n (t) on [0, oo) as follows: for all 1 < j < p, 

VnA°) = (38) 

*n, M (^) =M%) for0<fc<oo (39) 

and for intermediate values of t, we define ^ n (t) by linear interpolation. The goal 
of this section is to prove that this sequence of processes converge to a limit that 
is independent of the specifics of the distribution of the random variables 7&. Our 
first job will be to prove the existence of even just a subsequential limit of these 
processes. 

Consistent with our goal of proving part (ii) of Theorem ll.7l will we assume for 
the remainder of this section that 

E{l7 fc | 2 } = ^ry(l 



E 



{log 



3/2 



1-17*1* 



< 



< 



(FPT) efe 



(40) 
(41) 
(42) 



for some decreasing sequence of positive numbers tk converging to zero. Note that 
while (|42|) does not appear among the assumptions of this part of the Theorem, it 
is easily deduced from them. Similarly, 



E{log[ T ^ rF ]} 



< 



_i-i7)=i 2 . r ~ i'.- i-n • ( 43 ) 

Here and for the remainder of this section the implicit constants hidden in the < 
notation are permitted to depend on (3, the properties of the sequence and on 
the values of x under consideration; they will never depend on n. 

Before proceeding, let us pause to demonstrate that the exactly soluble model 
of Theorem II . 91 obevs the hypotheses given above. If a ~ 8^, then 



E { log" 



} = ^j t m e-^- 1 ^ 2 dt = T(m + l)^) , 

by making the change of variables e~* = 1 — \a\ 2 . 
Lemma 4.1. For fixed x, 

E{I<M£) - Mt: ) - x -^\ 2 } < (44) 

In particular, from k — we have E{|'0 S (^)| 2 } < — + {^-) 2 ■ Furthermore, 
P{ sup |^ m (f)-^(^)-^|> e }< e - 4 (^log[fM]+nL_ie fe ) (45) 



-fe<s< 

Proof. Let k be fixed and define 



x \ x(s — fc) 



for s > k. Note that <j> s +i = 4> s + T('0 s ,7s)- Equation (|17|) shows that this is a 
martingale; therefore, using (|19fl . (|43(l and l|25|) 



m — 1 



< m—k 



s—k 



s—k 
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This proves (|44|l : we now consider Ij45(l . By (|17|l . 

E{|0 S+1 | 4 } =E{|0 s | 4 }+6E{|^| 2 |T(^, 7s )| 2 } 

+ 4E{> s T(V> s , 7s ) 3 } +E{|T(^, 7s )| 4 }. 

Using (HHJ) and JSJ|, 

E{|0 s | 2 |T(^, 7s )| 2 } < E{|0 s | 2 | 7s | 2 } < 
while by 123, 123) and g3, 

E{|0 s ||T(V> s , 7s )| 3 } < E{[|^| + m + {s- fc)f ] log 3 / 2 



(46) 



~ n s+l ts ' 



Similarly using J2U and (|4*T|l . 

E{|T(^, 7s )| 4 }<^^Te 5 . 
By substituting the last three estimates into l|46|l . we obtain 



E{|^ s+1 | 4 }-E{|^| 4 } 



< s-*-' 1 



71 s+1 1 n s 



By summing the preceding estimate over s and using the fact that <j>k = 0, we 
obtain 



E{|^ 4 }<^l g[^]+H^ efe . 
Thus (|45|l follows by the martingale maximal theorem. 



□ 



Proposition 4.2. For fixed x, the processes ^ n (t) define a tight family of measures 
on C([0,oo);RP). 

Proof. We will prove that for fixed x and T > 1, 
limsupP(2e < sup{|^(f ) < s < t <nT and |s - t\ < nS}) = o(l) 

as 5 i 0. The Ascoli-Arzela theorem shows that this suffices; cf. 031 §1-4]. 

It suffices to consider the case n8 > 2. We wish to break the problem into blocks; 
to this end, let kj = \jnd~\ , that is, jn5 rounded up to an integer. The zeroth block 
is easiest: as ?/> s (-) — (s + 1)- is a martingale, 



c {Mf )l > e for some < s < k,} < e" 1 E{ |^ fel (f )| + (A* + 1)M} 



< <5e 



(47) 



Naturally this bounds the probability that \ip s — ipt\ > 2e for any pair s,i in the 
interval [0, fei]. 

For j > 1, we can apply Lemma 14 . 1 1 with e half the size and so obtain 
P{|V> S (~) - &(*)| > e for some % < s < t < fc i+1 } 
<e- 4 [51og[-f T ]+5e fej +(^) 4 ". 
Because e s decreases to zero, 

J- 1 ^^ J- iy ttj=o{l) 

3=1 3=1 
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as J — * oo uniformly in the region nS > 2. Combining this with log[//(j + l)] < j , 
we obtain 



\T/S] 



hmsup £ V{\M%)-M%) 



> e for some kj < s < t < k 



i+i} 



< 



<51og[l + T5- 1 ] +o(l) + 5 s 



o(l) 



where o(l) means as 5 J. for T fixed. 

Combining the estimate just proved with (|47|l gives the claim made in the first 
paragraph of the proof. Note that the change from e to 2e comes from the fact that 
the points s and t may lie in adjacent blocks rather than the same one. □ 

We now begin to examine the properties of any subsequential limit of 
specifically, we will prove that any such limit is the solution of a certain martingale 
problem. This approach to studying diffusions was introduced by Stroock and 
Varadhan and is very convenient for proving convergence; indeed much of this 
section modelled on 021 §H>2]. 

We can no longer treat the different values of x separately. For this reason, it 
is helpful to introduce some notation for the discrete vector problem. Because the 
equations are going become rather long, we need to make it as compact as possible. 
Let ipg' denote the vector with components 

^ = V s (if ), /i=l,2,...,p. 

Our first lemma shows that this discrete process almost solves a martingale problem. 

Lemma 4.3. Given x and f £ C C °°(R P ), let 



M n (m, k) = ) - xM$? ) + Tnjfaifv*®? ) Re{ [e**& - 1] [e 



1]} 



where subscripts on f denote partial derivatives and /i, v are summed over whenever 
repeated. Then for fixed integer T > 0, 



sup 

0<k<m<Tn 



!{M„(m,fc)-/($f)|^} 







in L 1 sense as n — > oo. 

Proof. The recurrence relation for ip says 



(48) 



Using this we expand fffis+i) m a Taylor series to third order. By incorporating 
(fTTjl we find 



where A s is the sequence of matrices 



(49) 



At" = 



E 



{iT(^, 7s )T(^, 7s )|^'} 



and b s are scalars obeying 

\b s \<^ + su P E{\r(^ ls )\ 3 \^}. 
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(The implicit constant here depends on x and the properties of /; this will be 
the case throughout the proof.) These expressions can be better understood using 
results from Proposition 12. 31 By (|2U|) . we have 



E 



{|T(^, 7s )T(^%7 s )|^} " E{| 7 ,| 2 } Re{[e^- - l][ e -*& - 1]} 



|^+^|E{| 7s | 2 log 
. |^+^|E{log 3/2 
and so by invoking (|25|l and (|42|l we obtain 



< 
< 



i-l7»l 2 



} 



E E { 

s=0 



/3iFFiy Re {[' 



;iu 1 :. 



l][e 



111 

+ ^E e ^°(l) (50) 



s=0 



as n — > oo with T fixed. Similarly by (12211 , we have 



E 



{|T« PS 7.)| 3 |^} < I^|e{ 1o S 3/2 [t4tf]} 



and so by l(IT]l. 



£>{|6 S |} = (1) 



(51) 



s=0 



as n — > oo. 

Combining (|49|) . I|5()|l. and l|51|l. we may conclude that 



ni 

lim Y E { E{M„(s + 1, s) - ) |$* } } = 0, 



(52) 



which is the key estimate for the completion of the proof. 
For any k < m, 



m— 1 



E 



{M„(m, fc) - } = E E { a7 «( s + !' s ) - /($*)|$? } 

S — fe 

m— 1 

£ E{E{M n ( s + 1,8)- /($*) }|^}, 



because the process is Markovian. Therefore, 

nT 



sup E{M„(m,fc)-/(^)|4"'} U^e{ e{M(s + M) -/($?>) |^} }, 

fc<m J s=Q U 



U< 

which converges to by (|5 



□ 



Proposition 4.4. Let x be fixed and let be a subsequential limit of the processes 
Then 

(a) // x^ > x v then $ ^ — vf^ is a non-negative junction with probability one. 
Similarly, for each /i, x^ ^ is a non-negative function with probability one. 

(b) E{||*(£)|| 2 } < t + t 2 for allte [0,oo). 
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(c) E{* M (i)} = x M t for all t e [0, oo). 

(d) For all t > and any f <G C™(W), 

M(t) :=/(*(*))- / i M / p (*(r))dT 



in 



/^(*(r))Re{[ e 1 *^^ - l][e- J *"^) - 1]} 



(53) 



is an Mt-martingale starting at to. Here Mt is the sigma algebra generated by 
^(r) over the interval to < r <t. 

Proof. To avoid unsightly subscripts in what follows, n is restricted throughout to 
lie in the subsequence that converges rather than be an arbitrary positive integer. 

(a) The subset K of functions with * p - > is closed in C([0, oo); W); 
therefore, P( V E' S -ftT) > lira sup P(\& n G i^) = 1. The same argument applies for the 
second statement. 

(b) By taking k — in (|44(l from Lemma f4. II 

E{|^(^)| 2 }=E{|^ s (^)| 2 }<f + (^) 2 . 

One now simply sums over /i and takes n — ► oo. 

(c) By (Pol) , we know that E{^„ iM (i)} = x M i. Moreover by hypothesis, we know 
that E{/(\& n (i))} — > E{/($(t))} for any bounded continuous function /. The result 
now follows by a simple approximation argument using part (b). 

(d) It suffices to show that 

E{G(*\ [t0jtl] )[M(t) - M(h)}} = 

for any bounded continuous function G from C([to, ti]; K p ) to [0,1] and any i > 
£i > io- As this random variable is a continuous function of 4", in the C([to, t]; W p ) 
topology, we need only prove that 

lim E {G(* w | [t0 , tl ]) [M n (t) - M B (ti)] } = 0, (54) 

where M n (t) is the analogue of l|53(l with 'J'n replacing 'J'. This will follow from 
Lemma l4~3l once we relate it back to the discrete process. We will use the notation 



k = |~nii] 

Note that * n (t) = (1 - 6)$% - 



1 and m = \ nt\ — 1. 

for some e [0,1), while * n (*i) is a 



combination of and ■ 

The standard estimate for the trapezoid rule, |3J §15.19], tells us that 



f [<?(<*) + <?(&)] 
for any a, 6 S 



< 



12 



||&-a|HMU° 



g((l — it)a + ub) du 
and any g £ C£°(R P ). Consequently, 

e^ jf s (* n (t))£ft-i f>(#*) } 



< 



< 



n 
n 



n 

lff/^llz. c 



s—k 



(55) 
(56) 
(57) 
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(Here and for the remainder of the proof, implicit constants are permitted to depend 
on t and t\.) The first term in ((56(1 comes from the ends of the interval. To pass to 
(|57Jl . we used l|H)l: 

Hmu-^\\ 2 }<1 

From the inequality immediately above we have 

E{|*„(i) - ^| 2 } +E{|*„(tO - | 2 } < i. 
Combining this with the estimate above for (|55(l . gives 

E{\M n (t) - M n (h) - M n (m, k) + f{^)\} -> (58) 
as n — > oo. In this way, we have reduced the proof to showing that 
limE{G(*„| [t0itl] ) [M n (m, k) - /($»)] } = 0. 

Now G(\E' n |[ to!tl ]) is a measurable function of 70, 71, . . . , 7fc-i, while M(m, k) is 
a function of ip^ , 7^, . . . , 7 m _i. Thus the statement above follows from 

lim E{M n (m,k) - f{^)Wk} = 0, 

n — >oo 

which is a consequence of Lemma 14.31 □ 

Naturally, the key to proving part (ii) of Theorem 11.71 is to show that (some 
subset of) these properties uniquely determines the process ^(t). For this not only 
shows that the limit exists without passing to a subsequence, but also that the 
limit depends only on the assumptions 1(40(1 and 1)41(1. not on the specifics of the 
distribution. Because these assumptions hold for the exactly soluble model given 
in Theorem ll.9l we may also identify the limit as C/3E. 

As described in §8.1], for to > 0, any solution of the martingale problem 1(53(1 
on [to, 00) gives rise to a solution of the system of stochastic differential equations 

dtf M = xy. dt + -fe Im { [e 1 *- - 1] [dBt (t) + idB 2 (t )] } (59) 

with independent Brownian motions B\ and Hi. The restriction to [fo,oo) is nec- 
essary because we only know this result for SDEs with bounded coefficients. As the 
same probability distribution gives a solution of (|59|l for all to > 0, we can speak 
of this as a solution of (|59l) on the open interval (0,oo). 

Proposition 4.5. For each x € W , there is exactly one non- anticipating solution 
of 1(59(1 on (0,oo) that obeys ^{t) > for all t > and 

E{1v(i)} = x M t. (60) 

Moreover, it is a strong solution, that is, "J is measurable with respect to the filtra- 
tion generated by the Brownian motions (cf. (221 §5.2]). 

Proof. Existence of a solution follows from Proposition 14.41 and (431 §8.1] or j^21 
§5.4.B]. To prove the other claims, we will construct a sequence of processes & n (t) 
on the same probability space so that is a measurable (and non-anticipating) 
with respect to the filtration generated by the Brownian motions and so that 

sup{|$„(f) - $(t)\ : n- 1 <t<n}^0 (61) 

in probability as n — > 00. 

By the standard existence and uniqueness theory for SDE with Lipshitz coeffi- 
cients (e.g. 0S1 §5-1] or j22 §5.2.B]), there is a unique solution $„(t) of lf5§|l on 



EIGENVALUE STATISTICS FOR CMV MATRICES 



19 



the interval [n _1 ,oo) with initial data $> n ,ii(n ) = x^n^ 1 ; moreover, $ ra is has the 
measurability properties just described and E{$„ iM (i)} = x^t. 

The process \&(t) — $„(<) is a martingale for t E [n _1 ,oo). By considering 
separately the cases when ^(n -1 ) — $„(n _1 ) is non-negative or non-positive and 
using the fact that solutions of a SDE cannot cross (cf. |23 p. 293]), it follows that 

Using the martingale property again and (HU we obtain 

P{ sup |*( < )_vI,„( < )|> e }<lE{|vI/ Al (n- 1 )| + |$„, Al (n- 1 )|}<-, 

which proves the claim. □ 

Remarks: Elementary manipulations of l|59|l allow one to deduce the following 
properties: 

i) For A > 0, ^f(Xt; x) and Xx) follow the same law. 

ii) For any fixed o£l, ^(x, t)— ^(a, t) and ^(x~ a, t) have the same distribution. 

Theorem 4.6. For any system of parameters obeying E { |a/c| 2 } = ^(fc 2 + i) + o(fc~ 1 ) 

and E{log 2 [l — |«fc| 2 ]} = o(fc _1 ) the eigenvalue process converges in distribution; 
moreover, the limit depends only on (3. 

Proof. By Proposition 14.51 there is a unique process with the properties described 
in Proposition 14.41 Thus all subsequential limits of ^ n (t) are identical and so the 
limit exists without passing to a subsequence. Moreover, the interpretation given in 
Proposition ^. 51 shows that the limiting distributions are consistent under extension 
of the vector x (in the sense of Kolmogorov's Theorem) and so may be regarded as 
the marginals of a probability measure defining a non-decreasing-function-valued 
process, ^(x,t). More importantly, Proposition ^. 51 shows that the distribution of 
the limit, ^(x, t), depends only on /3, not on the specifics of the distribution of the 
Verblunsky coefficients. 

Tracing our definitions back, we see that ?p n -i(x/n) converges in distribution to 
^(x,t — 1) at least for any finite collection points x € R. Therefore, by Propo- 
sition the eigenvalue process converges to that associated to ^(x.t = 1) in 
the sense of Definition 11.101 In view of the exactly soluble example described in 
Theorem II .71 we feel justified in calling this the C/3E process. □ 

5. Slow decay 

In this section we prove part (iii) of Theorem 11.71 That is, we prove that in 
the n — > oo limit, the re-scaled eigenvalues behave as a Poisson point process under 
the assumptions that the Verblunsky coefficients are independent, have rotationally 
invariant laws, 

E{(1-K|T S0 } <1 (62) 

for some sq > 0, and 

E{K| 2 } >(l + ky-\ (63) 

As in the previous sections, the constants implicit in the < notation are permitted 
to depend on the constants in these inequalities, sq and e included. 

Our goal here is to demonstrate the applicability of the methods of which 
prove this result when the Verblunsky coefficients at are identically distributed 
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(and so non-decaying). The approach taken there is in turn derived from the work 
of Minami, [25], on eigenvalue statistics for the Anderson model in the localized 
regime. To give full details of this argument would consume considerable space, 
much of which would more or less reproduce earlier published work with only a few 
minor changes. For this reason, we will be less detailed than might otherwise be 
desirable. 

We believe that under the assumptions set out above, Vn-i(^) converges to 
2ir times a Poisson jump process. Further, it appears that in the natural scaling 
limit with x fixed, the process ipk{-) also converges to (2tt times) a Poisson jump 
process (cf. |H]). It would be nice to deduce these facts directly from the recur- 
rence equation; we have not done this, but hope that someone will pursue these 
matters. This approach to the problem would be closer to the original approach 
to the one dimensional Anderson model, , although estimates on the size of the 
eigenfunctions are used even there. 

The basis of Minami's work is a limit law for a Poisson process — the analogue of 
the convergence of the binomial distribution to a Poisson distribution. Specifically, 
suppose that for each n we have independent point processes drjj, n for 1 < j < n 
and let us form the union, d£„ := • drjj, n . Then g?£„ converges to the Poisson 
process with intensity dv provided 

(a) drjj_ n is a null array, that is, sup.,- ¥{rjj^ n (I) > 0} — > as n ~ * oo, 

( b ) Ej p {%n(-0 > 2} -> as n -> oo, and 

(c) Ej HVjU 1 ) = 1} -» as n ► oo, 

for each compact set /. This result is Corollary 7.5 in |23 and can be found in 
most books on point processes. 

This limit theorem appears here in the following manner. Let us break the 
n x n CMV matrix C^ n > into ~ log[n] blocks of equal length N ~ n/log[n] by 
introducing independent random unitary boundary conditions (just as we did in 
the Introduction in order to obtain annxn matrix from the originally semi-infinite 
one). In this way, we obtain the direct sum of statistically independent matrices, 
so the eigenvalue process is the union of ~ n/log[n] independent point processes. 
This leaves us with two jobs: to check the hypotheses of the theorem above (this 
is quick) and to show that the introduction of these boundary conditions has not 
significantly affected the eigenvalue statistics (this is anything but quick). 

By choosing the boundary conditions uniformly from the circle, the distribution 
of the eigenvalues of each N x N block is rotationally invariant. This resolves the 
hypothesis (a) above and will suffice to justify (c) once we prove (b). To verify (b) 
we invoke the following result from Section 6.1 of |41j . 

Lemma 5.1. Given any CMV matrix of size N x N with Verblunsky coefficients 
that are independent and follow rotationally invariant laws, one has 

P(fi) < ±L 2 N 2 , (64) 

where fi is the event that two or more eigenvalues belong to a fixed arc of length 
2ttL. (Note that the expected number of eigenvalues in this arc is LN .) 

This settles the first of our two jobs. The second is to explain why the in- 
troduction of boundary conditions does not completely rearrange the eigenvalues. 
The reason is simple: the eigenfunctions are very strongly localized with sharply 
decaying tails. Proving this fact is not so simple; however, there is a wealth of 
work on the Anderson model (i.e., for related operators with non-decaying random 
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potentials) that provides guidance. We will be studying fractional moments of the 
Green's function, which is an idea of pQ; while our approach to estimating the 
transfer matrix is remeniscent of |271 . 

For most of the remainder of this section, that is, until Corollarv l5.6l we will be 
dealing with the full semi- infinite CMV matrix C. 

The Szego recurrence for the orthonormal polynomials can be encoded with a 
matrix: 

z -a k 



A, 



Vi -Kl 2 



-a k z 1 



For I > k, we define the transfer matrices 

T(l,k) = Ai- 1 Ai-2---A k+1 A k , 

with T(k, k) = Id. Notice that both A k and T(l, k) depend on z £ C; however we 
suppress this in the notation, since invariably z will be fixed. 

Lemma 5.2. Let a k follow a rotationally invariant law, then 

E{||A fe v||- 1 } < ini^E-fl-lKI 2 } (65) 

for any non-zero vector v £ C 2 and any \z\ = 1. 

Proof. Clearly it suffices to prove this result when v — [x, y) is a unit vector. In 
this little computation shows 

2 1 - ARe(xyza k ) + \a k \ 2 

\ A k v \ = ; i fa ' 

1 - \a k \ z 

The lemma now degenerates to the estimation of an integral. We will use following 
notation: r = \a k \, R = 2\xy\, 9 — azg(xya k ). Note that R,r £ [0, 1], in the former 
case, because \x\ 2 + \y\ 2 = 1. 

de 2 f 2 - VT-^ de_ 

v /l-2i?rcos(6') + r 2 2vr tt J ^1 - 2rcos(6») + r 2 2vr 



llHZ K (^) <l-y. (67) 
tt l + r \ 1+r J 4 K ' 



where K(fc) is the complete elliptic integral of modulus k. The first inequality here 
follows from the fact that this integral is an increasing function of R; to see this 
differentiate and compare the resulting integrand at 9 and 6 + tt. To obtain the 
second inequality with the factor j takes some trouble; however, the fact that it is 
true with some small coefficient follows easily by studying the behaviour as r I 0. 
This simpler fact is actually sufficient for our purposes. □ 

By applying this inductively, we obtain results for the transfer matrices: 

Lemma 5.3. Suppose the Verblunsky coefficients are independent and follow rota- 
tionally invariant laws. Given < s < 1 and \z\ = 1, 



E{||T(2,fc)|r} <exp 
for any k < I. More generally, 



l-l 

-fE 

j=k 



e{KI 2 } 



(68) 



(69) 
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for k < I < r. 

Proof. Let us first remark that (|68|l is the special case of 1(69(1 when k = I. We will 
prove a slightly more general result: 



E{|!T(r,0«iriM| s }<ex P -|^E{|a,| 2 } 



r-l 



3=1 



(70) 



where v is a random unit vector that is statistically independent of u.j for I < j < 
r — 1. Note that the result for general s will follow from s — 1 by Jensen's inequality. 

Clearly 1)70(1 is well formulated for proof by induction. When r — I one has 
equality. Assuming the result for T(r,V), we have 

^{WTir + ^lM-'Wvr} =E{\\A r T(r,l)v\\- 1 \\v\\ s } (71) 

<E{1- iKpjElllTMHriMI 3 } (72) 

by Lemma l5~2*l To finish the proof, notice that 1 — | < e~ x ^ for x £ [0, oo). □ 

Proposition 5.4. Fix < s < mm{^So,j}. With probability one, there exists a 
(random) unit vector v so that T(Z, k)v — > as I — * oo; moreover, there exists c > 
so £/iaf 



E{||r(Z,/c)v|| s } < (/c + l)exp[-c(r-/c e 
uniformly for < k < I and \z\ = 1. 



(73) 



Proof. We simply take expectations in the proof of the Osceledec Theorem. This 
theorem is covered in many textbooks, for example, |36l §10.5], which can be con- 
sulted for more detail on what follows. 

Let Pi denote the orthogonal projection onto the contracting subspace of T(l, k) £ 
SX(2,C); that is, the subspace where ||Tu|| = || T"!) _1 1| u|| . (This is one dimensional 
when llrll > 1.) Then 



\Pl+i-Pl\\ < 



\T(l,k)\\ 



(74) 



This is a ratio of independent random variables. As ||^4;|| 2 < 2(1 — |a;| 2 ) 1 , it 
follows from O that E{||A ; || 2s } < 1. (This implicit constant depends on s as do 
many more below.) Therefore, by applying Lemma |5.3I and l(63|l we see that there 
exists c > so that 



E{||P /+1 -P / || S } <exp 



j=k 



<exp[-c(r-fc e )] 



for any s < minjso, ^}. This implies that Pi+\ — Pi is almost surely summable and 
so Poo := lim^oo p exists. 
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Let v be a unit vector from the range of Poo- We must now study \\T(l, k)v\\. 
Using only elementary facts, we have 

E{||T(Z, k)v\\ s } = E{\\T(l, fe)P 0O || s } (75) 

OO 

<E{||T(;,fc)p|| s } + E E {ll r ( Z ' fc )ll s ll p ''+i- p ''H s } ( 76 ) 

r=l 
oo 

< e{\\t(i, k)\\- s } +J2mm, kw\\p r+1 p r \n. (??) 



r=l 



By Lemma f5. 31 and l|63l) . 

l-i 

' 4 



H\\T(l,k)\\- s } <exp 



<exp[-c(l e -k e )] 



for some c > 0. Next we deal with the sum: using (1741) . the Cauchy-Schwarz 
inequality, and the independence of the Verblunsky coefficients, we have 

E{||r(/,fc)|| s ||P r+1 -P r .|| s } 2 (78) 



The first factor is uniformly bounded by (|62|l . The second and third factors can be 
estimated via Lemma 15.31 In this way, we obtain 

E{||T(Z,fc)|| s ||P r+1 - P r j| s } < exp[ - c(2r e - l e - k f ')] 

for some c > 0. This is clearly summable in r; indeed 

oo 

^exp[-2cr e ] < (I + l) 1 '' exp[-2cZ £ ] . 

r=l 

Consequently, 

oo 

X>{||T(J, fc)||1|P r+ i - P.ir} < (/ + l)exp[ - cs(r - fc e )]. 

Collecting all the estimates we have derived, we obtain the existence of a c > 
so that 

E{\\T{l,k)v\\ s } < (Z + l)cxp[-c(r-fc £ )]. 

By reducing c a little bit, one may replace the polynomial factor by (k + 1) as 
written in the statement. □ 

It is presumably possible to obtain much more precise estimates for the decay of 
\\T(l, k)v\\ using the methods from [251 ( c ^- 1'^ §12-7]). While their techniques are 
strong, they pursue only the almost sure behaviour, which is why we dragged the 
reader through the mud here. 

In order to proceed, we need to upgrade Proposition 15.41 to an estimate on the 
Green's function (i.e., the matrix elements of the resolvent), 

G kl {z) := (C-z) u \ 
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for 



Here 



1. From jS3 §4.4], we learn the following 

G k i(z) 



(2z) 1 xi{z)pk{z) k > I or k = I is odd 
(2z)^ 1 Xk(z)Tri(z) I > k or k = I is even. 



(80) 



P21 
^21 



z~ l T(2l + l,0) 



'F(z) + 1 




z 1 ir 2 i-i 


F(z) - 1 




P21-1 



z~ l T(2l,0) 



F(z) + 1 
F(z) - 1 



X2l 



l T(2l + l,0) 



and 



Z 1 X2l- 
X21-1 



l T(2l,0) 



where F(z) is the Caratheodory function given in (|lip. Note that ^(z) is the 
complex conjugate of x^(l/z); in particular, these two sequences have the same 
moduli when z lies on the unit circle. 

For \z\ = 1, F(z) is purely imaginary with probability one because C has purely 
point spectrum with probability one. (For the latter fact, see (20 Theorem 12.7.5] 
or combine the Simon- Wolff criterion, |361 Theorem 10.2.1], with Proposition 
This implies that Pk{z) and TTk{z) are complex conjugates for such z. 

The way we have introduced pk and TTk belies their true significance; they are 
the Weyl solutions — they are proportional to the decaying solutions (whenever such 
solutions exist). In this way we find that for \z\ = 1 and k < I, 







n{z) 


Pk(z) 




^k(z) 



\T(l,k)v\\ (81) 

with probability one, provided v is chosen as in Proposition l5.4l 

Corollary 5.5. Fix < s < 1. There exists c > so that 

E{\G kl (z)\ s + \G lk (z)\ s } < (fe + l)exp[-c(r-fc e )] (82) 

uniformly for < k < I and \z\ = 1. Moreover, the same estimate holds uniformly 
in \z\ < 1 for a reduced value of c. 

Proof. By employing Kolmogorov's theorem, |11|. as in ^2 Lemma 4.1.1], we see 
that 

E{\G k i(z)\ s } < 2 2 - s sec(s7r/2). (83) 

for any choice of k,l and \z\ < 1. This shows us that it is sufficient to prove the 
result for s small, since the general version can then be recovered by interpolation 
(or rather, by Holder's inequality). 

Let us begin with \z\ = 1. From the discussion preceding this corollary, we have 



Gm(z) 




G lk (z) 


Gkk(z) 




Gkk{z) 



= \\m,k)v\\ 



(84) 



and so by the Cauchy-Schwartz inequality, 

E{\G u (z)\ s + \G lk (z)\ s } <2E{\\T(l,k)v\\ 2s }E{\G k k(z)\ 2s }. (85) 

The first factor here was estimated in Proposition 15.41 while the second can be 
bounded using iJSSJ. This proves l|82l) . 

To extend the result to z € B, one may apply the argument from Theorem 4.2 







□ 
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Corollary 5.6. Fix < s < 1. There exists c > and A > so that 

E{|(C (n) -z) a l \ s ) < Anexp[- cn e ~ l \l - k\] (86) 
for all \z\ < 1 and n. 

Proof. This result for finite matrices follows from the previous corollary by applying 
the resolvent identity as in Lemma 4.2.3 of 01]. To make the final result look 
cleaner, we employed the estimate 

\l e - k e \ > en^lk - l\, VMe[0,n], 

which follows from the lower bound on the derivative of x i— » x e . □ 

With Lemma 15.11 and Corollary 15.61 as input, one may now simply repeat the 
analysis of [2H| to prove part (iii) of Theorem 11.71 This input is also sufficient to 
prove Poisson statistics for the locations of the eigenfunctions as in [21]. 
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